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(54) A method for translating a source operation to a target operation, and computer program for 
the method 

(57) A method is provided for translating a source 
operation to a target operation 4. The source operation 
acts on one or more source operands, each comprising 
a binary integer of a first bit-width. The target operation 
is required to be evaluated by a processor, such as a 
computer, which performs integer operations on binary 
integers of a second bit-width which is greater than first 
bit-width. The source operation 1 is translated to a tar- 
get operation having at least one target operand. Steps 
3 and 5 identify whether the value of unused bits of the 
or each target operand affects the value of the target 
operation and whether the target operand or any of the 
target operands, is capable of having one or more 
unused bits of inappropriate value. If so, a correcting 
operation is added 6 to the target operation for correct' 
ing the value of each of the bits of inappropriate value 
before performing the target operation. 
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Description 

[0001] The present invention relates to a method of translating a source operation to a target operation. For exam- 
ple, the method may be used to permit efficient computation with integer values whose bit-width is not a multiple of a 
5 native integer bit-width of a target machine, such as a computer. Another application of such a method is in the simula- 
tion of high-level hardware description languages. The invention also relates to a computer program for performing such 
a method, a storage medium containing such a program, a computer programmed by such a program and a translation 
produced by such a method. 

[0002] Hardware description languages such as VHDL (IEEE Computer Society. IEEE Standard VHDL Language 
10 Reference Manual. New York, USA. June 1 994. IEEE Std 1 076 1 993.) and Verilog HDL (IEEE Computer Society. IEEE 
Standard Hardware Description Language Based on the Verilog Hardware Description Language. New York, USA. 
1 996. 1 EEE Std 1 364 1 995.) can be used to describe the behaviour of hardware circuits. These languages support the 
description of circuits involving integer arithmetic by supplying appropriate numeric types and operators. In order to 
design hardware efficiently, the numeric types are usually parameterised by the bit-widths of the integer values of the 
is type. 

[0003] For instance, the IEEE 1 076.3 NUMERIC.STD VHDL library (IEEE Computer Society. IEEE Standard Hard- 
ware Description Language Based on the Verilog Hardware Description language, New York, USA, 1996. IEEE Std 
1364 1 995) defines the UNSIGNED and SIGNED integertypes based on the type of bit vectors STD_ LOGIC VECTOR. 
Bit vectors are fixed-length lists of bit values which include the values 0 and 1 as well as other states such as U (unini- 
20 tialised) and 2 (high impedance). Although the length of the bit vectors (and therefore of the numeric types) is fixed, one 
can use several types of bit vectors (each having different lengths) in a hardware design. Note that since the individual 
bits of a VHDL numeric type can have values other than 0 and 1, a numeric type contains several values that do not 
correspond to valid integers. 

[0004] High-level hardware description languages, such as the Bach language (Akihisa Yamada, Koichi Nishida, 
25 Ryoji Sakurai, Andrew Kay, Toshio Nomura and Takashi Kambe. Hardware synthesis with the BACH system. Interna- 
tional Symposium on Circuits and Systems, 1 999, GB2 31 7 245), can be used to describe hardware at a level of 
abstraction usually used in programming languages such as C (Brian W. Kemihan and Dennis M. Ritchie. The C Pro- 
gramming Language, Prentice-Hall, USA, second edition, 1988). 

[0005] Since the C language, and programming languages in general, are designed to be used to program some 

30 particular architecture, the semantics of their integer types and expressions usually depend on the target architecture 
being programmed. For example, the C language (Brian W. Kemihan and Dennis M. Ritchie. The C Programming Lan- 
guage, Prentice-Hall, USA, second edition, 1988) supplies the int and unsigned int types which correspond to signed 
and unsigned integers whose bit-width is naturally supported by the target machine. The C integer types are too inflex- 
ible to be used for the efficient design of hardware and therefore the Bach hardware description language extends C by 

as exact width integer types, such as intw and unsigned intw for signed and unsigned w-bit integers, respectively. 

[0006] One of the many advantages of using a high-level, programming-language based, hardware description lan- 
guage like Bach over using a lower level one such as VHDL is the ability to simulate (on a general purpose architecture) 
the behaviour of designs at much higher speeds. For example, because of the particular level of abstraction of the 
VHDL numeric types described earlier, one cannot use the integer arithmetic of the target machine used for simulation 

40 directly. In general, one would have to represent every bit value of a numeric type as a separate byte/word on the target 
machine and therefore the simulation of VHDL arithmetic operations can be much slower than the native arithmetic of 
the target machine. Although Bach arithmetic is more similar to the native arithmetic of a target machine than VHDL 
numeric arithmetic is, one still cannot use the native target machine arithmetic naively because of the difference in the 
bit-widths. One therefore needs to develop methods for the correct and efficient simulation of the exact width of the 

45 arithmetic usually used in hardware descriptions. 

[0007] Retargetable compilers, such as the Valen-C compiler (Graduate School of Information Science and Electri- 
cal Engineering (Kyushu University), Advanced Software Technology and Mechatronics Research Institute of Kyoto, 
Institute of Systems and Information Technologies (Kyushu) and Mitsubishi Electric Corporation. Valen-C Compiler 
Development Document (English Version 1.01). Information Technology promotion Agency, Japan, April 1998), can be 

so used to compile an input source program to the assembly/machine language of different target machines. The input lan- 
guage of the Valen-C compiler is the Valen-C programming language which also extends the ANSI C language by exact 
wicfth integer types. The programmer can specify integer types of the form intw where w is any non-zero natural number 
representing the width of the type. However, the semantics of the Valen-C integer arithmetic is still dependent on the 
target architecture and an integer type intw is actually defined as being at least w bits long and its width is a multiple of 

55 the word size of the target machine. As a result, the behaviour of the compiled code differs from one target machine to 
another. 

[0008] De Coster et al t "Code generation for compiled bit-time simulation of DSP applications", 1 1 th International 
Symposium on System Synthesis (ISSS '98) 2-4 December 1998, Hsinch, Taiwan (IEEE Computer Society Publica- 
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tions) pp 9-14 disclose a method intended for translating, fixed point integer arithmetic into standard integer arithmetic 
for the efficient simulation of digital signal processing (DSP) applications. However, no attempt is made to minimise 
value correcting operations in the resulting integer arithmetic expressions, which thus make inefficient use of target 
machine resources. 

[0009] EP 0 626 640 and EP 0 626 641 disclose techniques for converting high level processing languages into 
machine language. These techniques require the intervention of a user to assess whether the value of an operation is 
likely to overflow when a program is executed by a computer. If so, a value correction operation is performed and may 
comprise a masking operation or a sign-extending operation depending on whether the operation is unsigned or signed. 
[0010] According to a first aspect of the invention, there is provided a method of translating a source operation on 
at least one source operand comprising a binary integer of a first bit-width to a corresponding target operation for eval- 
uation by a processor which performs integer operations on binary integers of a second bit-width which is greater than 
the first bit-width, the method comprising: translating the source operation to a target operation having at least one tar- 
get operand; identifying whether the value of unused bits of the or each target operand affects the value of the target 
operation and whether the target operand or any of the target operands is capable of having one or more unused bits 
of inappropriate value; and, if so, adding to the target operation a correcting operation for correcting the value of each 
of the one or more bits of inappropriate value before performing the target operation. 
[001 1 ] The at least one source operand may comprise at least one constant. 
[0012] The at least one source operand may comprise at least one variable. 
[0013] The at least one source operand may comprise at least one sub-operation. 

[0014] The source operation may comprise an arithmetic operation. The identifying step may comprise identifying 
an unsigned unmasked target operand and the correcting step may comprise a masking step. The first bit-width may 
be w bits and the masking step may comprise performing a bit-wise AND operation between the unsigned unmasked 
target operand and a binary representation of (2 W -1 ). The identifying step may comprise identifying a signed non-sign- 
extended target operand and the correcting step may comprise a sign-extending step. 

[0015] The source operation may comprise a relational operation and the correcting step may comprise a monot- 
onic operation. 

[0016] According to a second aspect of the invention, there is provided a computer program for performing a 
method according to the first aspect of the invention. 

[001 7] According to as third aspect of the invention, there is provided a storage medium containing a computer pro- 
gram according to the second aspect of the invention. 

[0018] According to a fourth aspect of the invention, there is provided a computer programmed by a program 
according to the second aspect of the invention. 

[0019] According to a fifth aspect of the invention, there is provided a translation produced by a method according 
to a first aspect of the invention. 

[0020] The term "unused bit" of a target operand refers to any bit thereof which is not used to represent the value 
of a corresponding source operand. 

[0021] The term "inappropriate value" of an unused bit refers to a value of an unused bit which, if present in the tar- 
get operand, will result in the target operation being evaluated to a value different from that of the corresponding source 
operation. 

[0022] ft is thus possible to provide a method for the efficient computation of arithmetic and relational expressions 
involving integers of any bit-widths using a given device which is only capable of computing integer arithmetic expres- 
sions of certain bit-widths. 

[0023] For a given device or mechanism, called the target machine, which is capable of computing integer arithme- 
tic of some specified widths, say, w 1? w 2 , etc., such a method translates arithmetic expressions involving integers of any 
width into efficient arithmetic expressions involving integers of the specified widths. The language of expressions involv- 
ing integer arithmetic of any width is referred to as the source language. Similarly, the language of expressions involving 
integers of the given widths , w 2 , etc., is referred to as the target language. Using this method: 

• An integer atomic expression (such as a constant or variable) having width w is represented by an atomic expres- 
sion whose width is that of the narrowest target language integer that is at least w bits wide. For example, an 
unsigned atomic expression of width 5 is represented by an atomic expression of width 16 if the target language 
contains integers of width 1 6, 32, 48, etc. 

20 (U5) 20 (U16) 

• The notation x^j^ denotes the fact that the expression has an unsigned integer type of width w and x -> y denotes 
the fact that the source expression x is translated into the target expression y. 
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• Arithmetic operations in the target language are used whenever possible. For example: 

a (U5) + b (U5) ~* B (U16) + b (U16) 

• Because of the different widths in the source language and target language, the values of the target operations may 
differ from those in the source language: 

20 (U5) + 30 (U5) = 1 8 <U5) "to** 20 (U16) + 30 (1 6) = 50 (16) 

and therefore value correcting operations may be required in order to ensure that the value of operations in the tar- 
get expression is the same as those in the source expression when this is required: 

a (U5) + b (U5) ( a (U16) + D (U16)) & 31 

where & is the bitwise AND operator. The operation x <u16) & 31 (U16) corrects the 16 bit value stored in the expres- 
sion X(ui6) assuming that it represents a 5 bit unsigned value. 

[0024] Value correcting operations are not applied to every sub-expression that is translated, but only when such 
operations are required. For example, the expression: 

( a <U5) + b (U5)) * C (U5)) / ( d (U5) + e (U5) ) 

is not naively translated into: 

(((( a (Ui6) + b (ui6)) & 31 (ui6)) * C (U16)) & 31 (U16) ) / ((d^gj + e^^) & 31 (U16) ) & 31 (U16) ) 
which uses 8 target machine operations, but into the more efficient 

((( a (U16) + b (U16)) * C (U16)) & 31 (U16)) 1 M d (U16) + e (U16)) & 31 (U16)) 

which uses 6 operations. 

• This is achieved by using an internal representation of the target expressions (stored in memory locations, or reg- 
isters, on the machine used to translate source expression representations into target machine representations) 
which includes 

1 . the representation of the target machine expression; 

2. together with the following additional information: the sign of the expression, the width of the original expres- 
sion (called the significant width), the width of the target expression (called the representative width), and some 
information on the possible values of the bits in the integer value of the target expressions that are not used to 
store the value of the source expression (called the high-bit properties). The bits in the target expression values 
that are not used to store the value of the source expression are called the unused bits. 

[0025] This information is called the additional type information. Of particular importance is the high-bit properties 
information which states whether the following two properties are satisfied: 

- the masked property which states that all the unused bits are 0. 

the sign-extended property which states that all the unused bits are the same as the most-significant bit of the value 
of the source expression. 

[0026] These two properties are significant because, 

if the expression is unsigned, then the value of the source expression is the same as the value of the target expres- 
sion if and only if the target expression has the masked property. 

if the expression is signed, then the value of the source expression is the same as the value of the target expression 
if and only if the target expression has the sign-extended property. 

[0027] By using additional memory locations to store the additional type information (apart from the memory I oca- 
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tions required to store the representation of the target machine expression), such a translation method can insert the 
appropriate value correcting operations in target expression only when: 

1 . the value of the target expression is different from that of the source expression (that is, when the expression is 
5 unsigned and does not have the masked property, or when it is signed and does not have the sign-extended prop- 
erty), and 

2. the operation being translated requires the correct target expression values of its operands. Certain operations, 
such as + and * do not require the correct value for their target operands as long as the values in the significant bits 
are the same as the values in the bits of the source expression. As a result their operands do not need to be value- 

w corrected when the target expression is constructed. 

Other operations, such as \ require correct values for their operands, and therefore they have to be value corrected 
if their appropriate high-bit property (masked if the expression is unsigned, sign-extended otherwise) is not set. 
The result of certain operations (such as V but not + or *) is correct and therefore the appropriate high-bit property 
can be set in the additional type information of the target expression so that no value correction on the resulting 

is expression is required later. 

[0028] Since value-correcting operations are not inserted naively in all operations, the resulting target expression 
is more efficient 

[0029] Such a method can be used when arithmetic involving integers of any bit-widths needs to be computed on 
20 a processor or any device which can efficiently compute arithmetic operations of certain bit-widths only. Industrial appli- 
cations of such a method include: 

• The efficient simulation of high-level descriptions (using Bach, or otherwise) of arithmetic circuits on some specified 
target machine. Hardware is described in the Bach high-level language, and a low-level synthesizable industrial- 
's strength hardware description is generated automatically. Since the hardware designer uses a high-level language 

instead of a lower level one (such as VHDL), the design process is much quicker, and therefore cheaper, than tra- 
ditional hardware design processes. The present method may be used in the simulation phase of the Bach design 
flow, where the Bach hardware description is validated before the actual hardware is synthesised. This makes the 
simulation phase much faster than the use of naive methods, therefore reducing the time spent in hardware design. 

30 

• The efficient simulation or compilation of arithmetic operations targeted to a machine whose native integer arithme- 
tic is different from the one used for simulation. For example, a program targetted to a particular processor (which 
may be slow or not readily available during the implementation stage) can be simulated on a faster, readily available 
processor whose architecture is different from the target machine architecture. 

35 

• The efficient simulation of the designs of embedded systems where the efficient simulation of both hardware and 
software is required. 

[0030] Advantages which are readily achievable include: 

40 

• The target expressions generated by such a method are efficient since the native integer arithmetic of the target 
machine is used directly and the number of value correcting operations is reduced. 

• The translation method itself is not tune/space expensive and can therefore be used in applications where a quick 
45 compilation of the source expressions into the target expressions is required. 

[0031] The present invention will be further described, by way of example, with reference to the accompanying 
drawings, in which: 

so Figure 1 is a flow diagram illustrating a translation method constituting an embodiment of the invention; 

Figure 2 is a block schematic diagram of a computer for performing the method illustrated in Figure 1 ; 

Figure 3 is a schematic flow diagram illustrating a translation method constituting another embodiment of the inven- 
55 tion; 

Figure 4 is a diagram illustrating the bit structure of a source integer and of a target integer to which it is translated; 
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Figure 5 is a diagram illustrating an internal data structure for representing target machine expressions used in the 
method shown in Figure 3; 

Figure 6 is a high-level flow diagram illustrating transformation of a compound source expression to a target expres- 
sion; and 

Figure 7 is a schematic flow diagram illustrating use of the method of Figure 3 in the design and implementation of 
hardware in the form of an integrated circuit. 

[0032] Like reference numerals refer to tike parts throughout the drawings. 

[0033] Figure 1 illustrates a method of translating a source operation to a target operation. The source operation is 
performed on binary integers of a first bit-width and may comprise an arithmetic operation or a relational operation. The 
target operation is to be performed by a device such as a computer which performs arithmetic and/or logic operations 
on integers with a second bit-width which is greater than the first bit-width. 

[0034] A step 1 inputs a source operation and a step 2 translates the source operation to a corresponding target 
operation. A step 3 detects whether any unused bits of a target operand can affect the target operation. The target oper- 
ation is, in this sense, "affected" if the presence of any unused bits may result in the evaluation of the target operation 
being different from the evaluation of the corresponding source operation. If none of the unused bits can affect the target 
operation, the target operation is output at a step 4 for subsequent processing by the target machine. 
[0035] If the unused bits can affect the target operation, a step 5 detects whether any target operand has inappro- 
priate unused bits. For example, it may be that the or each operand of the target operation has unused bits which are 
already set to values such that evaluation of the target operation will not be affected. In this case, control passes to the 
step 4 which outputs the target operation. However, if the or any target operand has inappropriate unused bits, a step 
6 adds a correcting operation to the target operation before outputting the target operation at the step 4. It can thus be 
ensured that, when the target operation is performed on the target machine, the resulting evaluation will be the same 
as that of the source operation. 

[0036] Figure 2 illustrates a computer which may be used to perform the translation method illustrated in Figure 1 . 
The computer comprises a microprocessor 10 which includes an arithmetic and logic unit 11 . The microprocessor is 
provided with an input interface 1 2 for receiving source operations for translation and an output interface 1 3 for supply- 
ing the resulting translations in the form of target operations. Tne computer has a read-only memory 14 containing a 
program for controlling the microprocessor 10 to perform the method illustrated in Figure 1. The computer also has a 
random access memory 15 for temporarily storing values which occur during operation of the microprocessor 10. 
[0037] The method illustrated in Figure 1 is a very basic embodiment of the invention. More sophisticated methods 
are described hereinafter in greater detail. The following methods perform translations of "expressions" which are 
equivalent to the "operation" described with reference to Figure 1. 

[0038] Arithmetic expressions evaluate, or compute, values, and we consider the set of values which contains: 

* unsigned binary integers of any bit-width; 

• signed binary integers of any bit-width. 

[0039] We assume the standard unsigned binary interpretation for unsigned integers, for example the unsigned 
binary numeral 1101 is interpreted as the integer 13; and we assume the two's complement interpretation for signed 
integers, for example the signed binary numeral 1 101 is interpreted as the integer -3. 

[0040] In order to differentiate between the different types of values expressions can evaluate to, we assume that 
all expressions are typed by a term of the form: 

♦ unsigned©^ or simply Uc* for unsigned integer values; or 

* signed^ or simply S(o> for signed integer values; 

where the width co is a non-zero positive integer representing the number of bits in the value. We usually write e^ to 
denote the feet that e has type t. The type of an expression is usually omitted when it can be inferred from the context, 
or when it is not relevant to the discussion. 
[0041] Expressions can be: 

atomic such as constants and variables, or 

compound which consist of an operator, op say, applied to a number of expressions e v e^-.-.e,,, called the oper- 
ands: 
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op(e v e 2 ...,e n ) 

[0042] Constants evaluate to a fixed vaJue (hence their name), while the value of variables is given by an assign- 
ment a mapping from variables to values of their type. The valuation of a compound expression depends on the oper- 
s ator and the evaluation of its operands. The evaluation of an expression can have side-effects: that is, it can change the 
assignment of the variables. For example, given the initial assignment: 

X (S16) ~* 10 (S16) 

10 ^(Sie) ~* 20 (S16) 

and assuming the C semantics for the operators =, +, * and, then the evaluation of: 

( X (S16) = X (S16) + 1 » y(S16) ~ X (S16) * /(S16)) 

returns the value 220( S16 j and modifies the assignment to: 



15 



X (S16) ~* 11 (SI 6) 

20 y<S16) ~ 220 (S16) 

[0043] The aim is to be able to compute expressions involving arithmetic sub-expressions of any bit-width using 
some target machine which is capable of efficiently computing arithmetic expressions involving certain bit-widths: o>|, 
cos,.... only. Existing methods (see for instance Giles Brassard and Paul Bratley. Atgorithmics: Theory and Practice. 

25 Prentice-hall, USA, 1998) can be used to compute arithmetic operations on the target machine efficiently on integers 
whose width is a multiple of one of the specified widths. Using these methods, we can assume that the given target 
machine can efficiently compute integer arithmetic involving an infinite number of widths: 

o>i,(02 2a> 1j 2a) 2 3(^,3(0^ etc. We refer to these bit-widths as the native bit-widths of the target machine. We 

refer to the set of expressions that can be computed directly by the target machine as the target language defined more 

30 formally by the following statement The expression e is in the target language if and only if the widths of all the sub- 
expressions of e are native bit-widths. We also define the source language as the set of expressions of any bit-width. 
[0044] Figure 3 gives an overview of the method by which expressions 20 in the source language can be computed 
by the given target machine 21 . The core of this method is the translation mechanism 22 which takes a type correct 
source expression 23 and returns an internal representation 24 of the target expression. An expression is type correct 

as if it obeys a number of language-dependent typing rules such as, for instance, that the types of the operands of an arith- 
metic binary operation are the same as the type of the value returned by the operation. We can therefore assume that 
the source expression is first type-converted into a type correct expression. The type-conversion mechanism depends 
on the semantics of the source language and is not discussed here (see Appendix A6 of (Brian W. Kernighan and Den- 
nis M. Ritchie. The C Programming Language. Prentice-Hall, USA. second edition. 1 988) for the type-conversion mech- 

40 an ism of C expressions). Given a type correct source expression, the translation mechanism 22 returns an internal 
representation 24 of the target expression from which a target machine readable expression 25 can be extracted and 
computed. The mechanism for extracting the machine readable expression from the internal representation of the tar- 
get expression depends on the target machine used and is not discussed here. 

[0045] Figure 4 illustrates how a source integer value can be represented by a target integer value. Basically, an n- 
45 bit integer is represented by an m-bit target integer of the same sign, where m is the smallest native bit-width that is 
greater than or equal to n. The source integer bits 26 are stored in the lowest n-bits 27 of the target integer. The highest 
(m - n) bits 28 in the target integer value are called the unused bits. The lowest n bits 27 of the target integer value are 
called the significant bit-string. In order that the interpretation of the target value is the same as that of the source value: 

so • if the source integer value is unsigned, then the target integer value is masked: the unused bits in the target integer 
value are set to 0. 

• if the source integer value is signed, then the target integer value is sign-extended: the unused bits of the target 
integer value are set to the highest bit in the significant bit-string. 

55 



[0046] The internal representation of the target language expressions is illustrated in Rgure 5 and includes: 
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• the external representation 29 of the target expression, which depends on the target machine; as well as 

• some type information 30 which is used by the translation process in order to generate efficient target expressions. 

5 [0047] We use the notation </)f to denote the internal representation of the target expression f with type information 
/. The internal representation of the target expression should have an extraction operation which returns a target 
machine readable representation of the expression. 

[0048] The type information 30 is a data structure which has the following functions: 

io sign: The sign of the expression which can be either signed or unsigned. 

significant width: The significant width of a target expression is the width of the original expression it represents. 

representative width: The representative width of a target expression is its actual bit-width on the target machine. 
is The representative width of an integer expression is greater than or equal to its significant width. The type of an 
expression is said to be precise if its representative width is the same as its significant width. The bits in an impre- 
cise integer representation that are higher than its significant width are called the unused bits. 

high-bit values: The high-bit values, or high-bit properties of an imprecise expression give some information on its 
20 unused bits. The high-bit properties of an expression are: 

masked: A target expression is given the masked property if, for every assignment, it always evaluates to a 
value whose unused bits are all set to 0. 

25 sign-extended: A target expression is given the sign-extended property if, for every assignment, it always 

evaluates to a value whose unused bits have the same value as the highest bit in the significant bit-string. 

[0049] The properties are not mutually exclusive, as shown in the following example which lists some integer values 
with representative width 1 6 and significant width 9: 

30 



40 



Integer Value (in bits) 


High-Bit Properties 


unused bits 


significant bits 


masked 


sign-extended 


0000000 


101011011 


yes 


no 


1111111 


101011011 


no 


yes 


0010001 


101011011 


no 


no 


0000000 


010101101 


yes 


yes 


1111111 


010101101 


no 


no 



45 [0050] A target expression is said to be faithful if, for every assignment, it evaluates to a value whose interpretation 
is the same as that of its significant bit-string (assuming the same sign). Therefore, an expression is faithful rf: 

• it is unsigned and has the masked property, or 

• it is signed and has the sign-extended property. 

so 

[0051] it is sometimes convenient that every sub-expression of a target expression is also annotated with the type 
information because: 

• the sign and representative width of the expression may be required by the operation which extracts a machine 
as readable representation of the target expression; 

• one can develop optimisations on the internal representation of the target expressions which make use of the sig- 
nificant width and high-bit properties. 
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[0052] We can also generalise the high-bit values of the expression to give us the extent of the masking and sign- 
extension of the expression by using the following two values: 

masking extent A target expression e has a masking extent of n, where n is smaller than the representative width 
5 of e, if for every assignment it always evaluates to a value whose bits in position greater than or equal to n are 0. 

00000001 0101 0001 1 has a masking extent of 1 0. 
11111 0000001 01 01 0 has a masking extent of 1 7. 

io sign-extension extent: A target expression e has a sign-extension extent of n, where n is smaller than the repre- 
sentative width of e and greater than 0, if for every assignment it always evaluates to a value whose bits in position 
greater than or equal to n are equal to bit n-1 . 

00000001 0101 0001 1 has a sign-extension extent of 1 1 . 
is 11111 0000001 01 01 0 has a sign-extension extent of 1 3. 

[0053] Note that we are using the convention that the least significant bit of a bit-string is in position 0. 
[0054] Representations of integer expressions can be made faithful by applying value correcting operations which 
guarantee that the values evaluated by the expression are masked or sign-extended as required. We assume that the 
20 following two operations on target expression representations can be defined: 

a masking operation which takes an expression f and returns an expression r* such that, for every assignment, 
the significant bit-strings of f and f are the same, and that f has the masked property. 

25 a sign-extending operation which takes an expression f and returns an expression f such that, for every assign- 
ment, the significant bit-strings of f and f are the same, and that f has the sign-extending property. 

[0055] Given an expression f, we use the notation m(f) and s(f) to represent the expression t applied to the masking 
operation and the sign-extending operation, respectively. We also define the 'make-faithful' operation f(f) which given 

30 an expression f returns m(r) if f is unsigned, and returns s(f) if f is signed. 

[0056] We assume that the evaluation of the target expressions represented by m(f) and s(f). may be more 
time/space expensive than the evaluation of the target expression represented by f and that, therefore, the translation 
of source integer expressions into target integer expressions should try to minimise the applications of these opera- 
tions, rather than applying them naively to every sub-expression that is translated. 

35 [0057] In certain cases, the implementation of a value correcting operation can be such that the time/space cost of 
evaluating some particular expression is the same as the evaluation of the value corrected expression. This is certainly 
the case for an already value corrected expression, since the operation can simply return the original expression. It is 
also the case for a constant expression, since it can be value corrected by correcting the constant value directly and 
thus returning another constant expression (of the same cost). An expression t is easily masked if the evaluation of m(f) 

40 has the same cost as the evaluation of t. Similarly, an expression t is easily sign-extended if the evaluation of s(r) has 
the same cost as the evaluation of f. 

[0058] When translating integer expressions involving side-effect free relational operators (such as < and <=) which 
return a boolean value (this is a representation for the two truth values: true and false), it may not be necessary to apply 
a value correcting operation to the operands. Instead, it is enough to apply a usually cheaper monotone operation on 
45 the operands which satisfies the following properties: 

* °Ps( s i » s n) computes a boolean value representing true if and only if opr(mn(fj ),....,mn(f„)) computes a boolean 

value representing true; 

* opsfs! ,....,s n ) computes a boolean value representing false if and only if opj(mn(fy ),....,mn(f„)) computes a boolean 
so value representing false; 

where mn(r) represents the monotonic operation applied to the target expression f, op s is an n-ary relational operation 
in the source language, opj is the corresponding n-ary relational operation in the target language, s 1( . -,s n are source 
expressions, and ff t n are their corresponding (not necessarily value corrected) target expressions. 
55 [0059] Different relational operators may require different monotonic operations. Also, there can be several monot- 
onic operations that are applicable to the same relational operator. 

[0060] We now describe the transformation of type correct source expressions into the internal representation of 
the target expressions. We first describe the general mechanism for translating a source expression into a target 
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expression, and then give a concrete illustration of this process on a set of arithmetic expressions involving several C- 
style operators. 

[0061] The transformation of the source expressions into target expressions depends on the different operators in 
the source language. The method uses the following properties of the operators in the source language: 

5 

Property 1 deals with the values of the unused bits of the result of the operation. In certain operations, such as +, 
- and *, the unused bits of the operands do not affect the significant bits of the result and the operands of such oper- 
ands are not applied to the value correcting operations when the source expressions are translated into target 
expressions, if the unused bits of an operand can affect the significant bits of the result, then this particular operand 
10 needs to be value corrected, or, in the case of relational operations, all operands need to be applied to a valid 
monotonia operation. 

Property 2 deals with the way the unused bits and significant bits of the operands affect the unused bits of the 
result of the operation. In certain operations, such as \ and %, the value of the result is guaranteed to be faithful, 
is and therefore the appropriate high-bit property can be set in the type information of the target expression. 

[0062] Basically, the method described here: 

• uses an extended data structure representing the arithmetic or logic expression which includes the high-bit values 
20 of the expression . 

♦ considers the above two properties to apply the value correcting functions to the operands of an operation only 
when this is required. 

[0063] These points illustrate that the present method differs from known methods in which the value-correcting 

25 functions are always applied to the result of the operation. 

[0064] For example, we can extend the data structure to include the masking and sign^extension extent of the 
expression, and from these properties the present method significantly reduces the number of value-correcting func- 
tions required during the translation. In the following we simply store the masked and sign-extended property in the 
data-structure of the expressions, and explain how these two properties are used effectively by the present method to 

30 generate efficient target expressions. If we consider the masking and sign-extension extent instead of the masked and 
sign-extended property, then the method will generate slightly more efficient expressions. For example, the sub-expres- 
sions of the following expression: 

( a (U5) & 3 (U5)) + 1 (US) 

35 

have the following masking extents: 

^US) & 3 (U5) 2 
1(U5) 1 

40 

and, by using the property of the +• operator that the masking extent of the result is 1 added to the maximum masking 
extent of its operands, then we deduce that the masking extent of the main expression is 1 +2=3. We therefore know that 
the main expression is masked, and hence faithful. As a result it never needs to be value corrected. If, instead of keep- 
ing track of the masking extent, we simply keep track of the masked property, then from the fact that the above two sub- 
45 expressions have the masked property, we cannot deduce that the main expression is also masked. This means that 
the expression may need to be value corrected by adding an additional masking operation. 

[0065] Given a source expression representation e of width a* the target expression representation is constructed 
as follows: 

so Step 1 A table associating high-bit properties with the variables in the source expression e is constructed. This 
table is called the high-bit property mapping and has the form: 



Variable 


High-bit properties 


U 0 


Po 
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(continued) 



Variable 


High-bit properties 


Uo 


Pi 




Pn 



where U 0 U n are al! the variables in the source expression e, and P 0 P n are their corresponding high-bit prop- 
erties which may be one of the following: masked, sign-extended, or none (that is three possible values). The high- 
bit properties combination where both the masked and the sign-extended properties are set is not allowed in this 
table. Several data structures can be used for the internal representation of the high-bit property mapping and 

appropriate ones include association lists and sorted binary trees rf the variables U 0 U n can be ordered. 

The high-bit property mapping does not affect the correctness of the translation mechanism and, for instance, 
one can use the trivial mapping which gives no high-bit properties to all variables. However one can develop heu- 
ristics which construct a high-bit property mapping depending on how (and how often) the variables are used in the 
source expressions, so that the target expressions constructed by this translation mechanism can be potentially 
more efficient. 

Step 2 The type information with the following data is constructed: 

• the same sign as the source expression; 

• the significant width co; 

• the representative width r is chosen to be the smallest native width larger than or equal to (o; 

• no high-bit properties are set (for the moment). 



This type information is denoted by i e . The type information i e is constructed from the type of e. 
For example, given a signed 5-bit expression and assuming that the native bit-widths of the target machine are 
16, 32, 48, etc., then the type information with the following data is constructed: 



sign: 


signed 


significant width 


5 


representative width 


16 


high-bit properties 


none 



Again, several basic data structures can be used for the internal representation of the above type information. 
An appropriate one is a record structure with a boolean field for the sign, two integer fields for the significant and 
representative widths, and two boolean fields for the high-bit properties (one for the masked property and the other 
for the sign-extended property). 

Step 3 The internal representation of the target expression is constructed recursively and the appropriate high-bit 
properties fields in the type information are set. The base case of this recursive method is when the source expres- 
sion e is an atomic expression; the recursive case is when the source expression e is a compound expression. 

base case: If e is an atomic expression, then a corresponding target atomic expression is constructed. 

1. If e is a source constant expression e s , then the target constant expression </,) e t is constructed, where 
the value of the constant e t is such that its interpretation is the same as that of e 5 , and the type information 
i t is the type information i e (which was constructed in step 2 above) with the high-bit properties set accord- 
ing to the value of the unused bits in e t . 



11 



EP 1 091 292 A2 



2. If e is a source variable expression, then a target variable expression is constructed. A unique target 
variable identifier is used for every source variable identifier. More formally, the mapping from source var- 
iable identifiers into target variable identifiers is a total injective mapping. The high-bit properties fields of 
the type information i e constructed in step 2 above are set according to the entry in the high-bit property 
mapping constructed in step 1 . If the high-bit property mapping entry for the source variable is the masked 
property, then the masked field is set. Similarly, rf the entry is the sign-extended property, then the sign- 
extended field is set If the entry states that no high-bit properties are assigned to the source variable, then 
no high-bit properties in the type information i e are set 

recursion: A source compound expression op (e 7 , e 2 , e n ) 31 as shown in Figure 6 is transformed into a 

target expression as follows: 

1 . The source operands e h e 2 e n are first translated 32 into target expressions t 1f f n using this 

method recursively. 

2. The target expression 24 is then constructed using the operations 33 of the target machine such that 

• The target sub-expressions used in an arithmetic operation are applied to the masking operation or 
the sign-extending operation if the values in the unused bits of the operands can affect the values of 
the significant bits of the result of the operation. 

• In the case of relational operations which simply compare the value of two or more expression (such 
as <), then ft is usually enough to apply a monotonic operation which uses the value of the significant 
bit-string only and returns a value which uses all the bits in the target integer value. An efficient exam- 
ple of this is the operation which, given an expression e, returns e « {r-w), where « is the left-shift 
operator, and r and w are the representative and significant widths of e. 

• When different operations (masking, sign-extending, monotonic operations) can be used on the inter- 
nal representation of the target operands, then the operation which results in the cheapest target 
expression is chosen. 

• The type information i t of the target expression is constructed from the type information i e (step 2 
above) by setting the masked high-bit property when it can be guaranteed that the value of the oper- 
ation is always masked; and simi tarty, the sign-extended high-bit property is set when it can be guar- 
anteed that the value of the operation is always sign-extended. This step depends on the second 
property of the arithmetic operator concerned as described hereinbefore. 

[0066] Again, there are several data structures which can be used to represent the target expression internally. An 
appropriate one is a tree structure where the leaf nodes represent atomic expressions and the non-leaf nodes repre- 
sent operators. 

[0067] Finally, as shown in Figures 3 and 5, the data representing the type information is stripped from the final 
internal representation of the target expression. 

[0068] In the following example, it is assumed that the source and target languages contain constants, variables 
and compound expressions involving the following operators: 

• The unary operators: !, ~ and - whose semantics are the same as their respective C unary operators as given in 
with the typing rule that the type of the operand of these operators should be the same as the type of the value of 
the operation. 

• The binary infix operators: +, -, *, /, %, &, I, a, &&, II, =, f=, <=, >=, <, >, «, », »„ and =, whose semantics are 
the same as their respective C binary operators with the exception that for simplicity shifts (« and ») by values 
greater than the bit-width of their first operand are undefined. We also assume the following typing rules: 

1 . the operands of all the operators with the exception of =, !=, <=, >=, <, >, «, » and, have the same type 
as that of the value they return; 

2. the operands of the relational operators =, !=, <=, >=, < and > have the same type; 

3. the first operands of « and » have the same type as the value they return; the second operands of « and 
» are unsigned; 

4. the second operand of, has the same type as the value it returns. 

• Conditionals of the form: 
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e 1 ?e 2 :e 3 

whose semantics is the same as C conditional expressions. We assume that the operands e 2 and e 3 have the 
same type as the value returned by the conditional expression. 

5 

• Typecasts of the form: 

(type)e, 

10 where type has the form signed w or unsigned w which gives a new sign (sign or unsigned) and width (w) to the 
operand e/. The value of this operation is sign-extended automatically if the operand e 1 is signed and narrower 
than w. 

[0069] Given the above operators, simple masking and sign-extending operations can be implemented as follows: 

15 

masking Given an expression (i t )t, if the type information i t has the masked property, then this operation simply 
returns (i$t Otherwise, if the significant width of the expression is w, and its representative width is r, then we are 
required to mask the value of the expression with the r bit integer whose lowest w bits are 1 and whose highest r - 
w bits are 0. Let us call this integer m. The type information i m associated with m has the same sign as / fr the sig- 
20 nificant width r and the representative width r. The required expression is therefore: 

il')W)t & ii^m) 

where i' t is the type information resulting from setting the masked property to i t and unsetting its sign-extended 
25 property. Given an expression e, we denote the application of the masking operation by m(e). 

sign extending Given an expression^)/, if the type information i t has the sign-extended property then this opera- 
tion simply returns (j)t. Otherwise, if the significant width of the expression is w, and its representative width is r, 
then we can sign-extend the value of t by left-shifting by r - w places, and then applying an arithmetic right-shift by 
30 r-w places. (Note that an arithmetic right-shift preserves the highest bit of its operand.) Let us denote the constant 
r - iv by u. The type information i u associated with u is unsigned, has the significant width r and the representative 
width r. If /, is signed, then the required expression is: 

C7(«fV«/;' « Oju)) » (iju) 

35 

where i' t is the type information /, made precise by setting the significant field to r, and i" t is the type information 
constructed from i t by setting the sign-extended high-bit property, and unsetting the masked property, 
if i t is unsigned, then the sub-expression: 

40 «'V«'f>' « </'„>")) 

is type cast by signed r before it is right-shifted (in order to achieve an arithmetic right-shift, rather than a logical one), 
and the final expression is then type cast by unsigned r to retain 
its sign. 

45 Given an expression e, we denote the application of the sign-extending operation by m(e). 

[0070] We recall the definition of f (e) as m(e) if the expression e is unsigned, otherwise as s(e) if e is signed. We 
also define the following monotonic operation required during the translation of the relational operations involving =, 
!=, <, <=, > and >=: 

50 

monotonic operation Given an expression (/,>f with significant width w and representative width r, this operation 
returns: 

(/Vtt/^f « iiju) 

55 

where u is r - w t and the type information i' t has the same sign as i p the significant width r and representative width 
r. The type information i u is unsigned, has the significant width rand representative width r. Given an expression 
e, we denote the application of this monotonic operation by mn(e). 
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[0071 ] More efficient implementations of the masking, sign-extending and the above monotonic operations are pos- 
sible, although the above implementations are sufficient for the correctness of the translation mechanism described 
herein. 

[0072] Such more efficient implementations use the technique of applying optimising transformations to improve 
5 the efficiency of the expressions. The following are transformations that can be applied on the operations: 
[0073] A masked operation e of the form t and m can be optimised as follows: 

• If t is a constant k, then the value of e is calculated to say, I, and e is replaced by I. 

• ff t is of the form x & k or k & x where k is a constant, then the value of k and m is calculated to say, I, and e is 
10 replaced by t& I. 

• If t is of the form x & y where either (or both) x & m or y & rn can be optimised to say, x* and y", then e is replaced 
by x 1 & y\ Similar rules apply for operations of the form x I y, x A y, z ? x : y. 

[0074] A sign-extended operation e of the form (t«u) »u can be optimised as follows: 

15 

• If t is a constant k, then the value of e is calculated to say, I, and e is replaced by I. 

• If t is of the form x«k where k is a constant, then the value of k + u is calculated to say, I, and e is replaced by (t«l) 
»u. 

• ff t is of the form x & y where either (or both) (x«u)»u or (y«u)»u can be optimised to say, x" and y\ then e is 
20 replaced by x* & y\ Similar rules apply for operations of the form x I y, x A y, z ? x : y. 

[0075] A monotonic operation e of the form t«u can be optimised as follows: 

• If t is a constant k, then the value of e is calculated to say, I, and e is replaced by I. 

25 • If t is of the form x«k where k is a constant, then the value of k + u is calculated to say, I, and e is replaced by t«l. 

• ff t is of the form x & y where either (or both) x«u or y«u can be optimised to say, x 1 and y\ then e is replaced by 
x 1 & y\ Similar rules apply for operations of the form x I y, x A y, z ? x : y. 

[0076] We may sometimes require to choose the most efficient expression from a number of expressions. Given a 

30 number of expressions e 1 e n , we use the term the cheapest of e 1 e„" to denote the expression in e ; e n that 

is likely to be computed most efficiently. We also define this term on pairs of expressions: the cheapest of [e 1 ,e' 1 ),.... t 

(e^e'n) 9 is the pair of expressions in (e,e*) in (e f ,e' 7 ) (e^e'J such that the computation of e followed by that of e' 

is likely to be the most efficient A good measure of the efficiency of a target expression is the number of operators in 
its internal representation: 
35 the cheapest expression is the one with the least number of operators. 

[0077] We say that some type information is the type information i 2 made faithful if is constructed from i 2 by 
setting the masked high-bit property if i 2 is unsigned, and by setting the sign-extended high-bit property if i 2 is signed. 
[0078] We now describe the method for translating source expressions into target expressions: 

40 • Constants and variables in the source language are transformed into target constants and variables as explained 
earlier. 

• Given a unary operation e of the form: 

45 op(eJ 

we first construct the type information i t from the type of e (i t has no high-bit properties), and then transform the 
operand recursively into f 7 . The expression f 7 is annotated with its typing information since it is represented by 
the internal representation described hereinbefore. The required target expression: 

so 

('"VoPCi) 

is then constructed, where if the operator op is: 

55 I: the expression f f needs to be masked or sign-extended since the value of the unused bits of f 1 may affect 
the significant bit-string in the result of the ! operation. Therefore the expressions m(^) and s(f 7 ) are con- 
structed and f 1 is chosen to be the cheaper of the two. The type information /*, is the type information i t 
made faithful. 
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~: the required operand f 7 is simply f t since the unused bits in f 7 do not affect the significant bit-string in the 
result of the ~ operation. The type information i' t is constructed from /, by setting the sign-extended property 
if and only if t 1 is sign-extended. 

-: the required operand f t is 1 1 and the type information i' t is i t since the value of the arithmetic negation oper- 
ation - cannot be guaranteed to be always masked and/or sign-extended. 

Given a binary operation e of the form: 

e 1 op e 2 

we first construct the type information i t from the type of e, and transform the operands e 1 and e 2 into f 7 and 
respectively. The required target expression is: 

</V(f t opf' 2 ; 

where if the operator op is: 

+,-,*: the operands t 1 and t 2 are f 7 and t 2 , respectively. The type information i' t is i t 

A % : the operands f 1 and f 2 are f(f n ) and f(/y . The type information i' t is i t made faithful. 

&&, It : the operand V 1 is the cheapest of m^), s(r^; and the operand f 2 is the cheapest of m(t 2 ), s(fy. The 
type information i) is i t made faithful. 

& : the operands t 1 and f 2 are f 7 and t 2 . The type information i' t is constructed from i t by setting the masked 
property if either t 1 or t 2 is masked, and then setting the sign-extended property if both t 1 and t 2 are sign- 
extended. 

I, A : the operands f 1 and f 2 are f t and t 2 . The type information P t is constructed from // by setting the masked 
property if both t 1 and t 2 are masked, and then setting the sign-extended property if both t 1 and t 2 are sign- 
extended. 

=, != : the pair of operands (f 1t fy is the cheapest of (m(f^, m(f 2 )), (s(f y ), s(f 2 )), (mn(fj), mn(f 2 )). The type 
information P t is i t made faithful. 

<=, >=, <, > : the pair of operands {f 1t ry is the cheapest of (f(f^,f(f 2 )), (mn(f ? ), mn(f 2 )). The type information 
i t is /, made faithful. 

« : the first operand r > 1 is the expression fj, and the second operand t 2 ^ Kh)- The tyP e information r" t is / t . 

» : the operands r* 7 and f 2 are f(f r j and f(f2), respectivety. The type information /" t is / t made faithful. 

= : the first operand f A is simply the expression f v The second operand t 2 is m(f 2 ) if f^ is masked, s(f 2 ) if 
is sign-extended, or otherwise it is t 2 . The type information i\ is constructed from i t by setting the masked prop- 
erty if f 2 is masked, and then setting the sign-extended property if f 2 is sign-extended. 

, : the operands f y and f 2 are fj and t 2 . The type information i' t is constructed from i t by setting the masked 
property if t 2 is masked, and then setting the sign-extended property if t 2 is sign-extended. 

Given a conditional expression e of the form: 

e,?e 2 :e 3 

we first construct the type information i t from the type of e, and then transform the operands e n , e 2 and e 3 into f 1( 
t 2 and f 3 , respectively. The required target expression is: 

OVC, ?t 2 :t 3 ) 
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where is the cheapest of m(f 7 J, s(tj). The type information i' t is constructed from i t by setting the masked prop- 
erty if both t 2 and t 3 are masked and then setting the sign-extended property if both t 2 and t 3 are sign-extended. 

* A type cast expression e of the form: 

(fype s ) e A 

is transformed into the required target expression as shown below: 

1 . The type information i t is constructed from the type type^ Let us define s t and r t as the significant width and 
representative width in l b respectively. Let us define type s as the type of the required target expression; that 
is, type, has width r t and the sign of type s . 

2. The operand e 1 is transformed into the expression f 7 . Let us define s, and r f as the significant width and 
representative width of t 1> respectively. 

3. If Sf is the same as s 1t then the target expression is: 

</f ')(fype,) f , 

where is constructed from i t by setting the masked property if t f is masked and then setting the sign- 
extending property if 1 1 is sign-extended. 

4. If s t is smaller than s, then the required target expression is: 

(i^{type t )t 1 

5. If s t is greater than s 1 then: 

(a) If r> is the same as r f then the target expression is: 

{^ t1)) )(type t )f(t,) 



The type information i[WV) j S constructed from i t by setting the masked property rf f (trf is masked, and 
then setting the sign-extending property if f(f 7 ; is sign-extended. This does not make the type information 
i^HW) faithful since the sign of f(f ^ may be different from that in type t 
(b) If r t is greater than r 1 then the target expression is: 

where is /; made faithful. 
[0079] We now illustrate this method by translating the following source expression 

(( a (U5) + D (US)) * C (U5)) I ( d (U5) + e (U5)) 

into a target expression that can be simulated by a 1 6 bit architecture. 
[0080] The given source expression is already type correct 

Step 1 : The trivial high-bit property mapping is constructed which associates every variable in the source expres- 
sion with the "no high-bit properties" entry. 

Step 2: The type information data structure is contructed. Since the type of the source expression is unsignedS and 
the target architecture is a 16-bit one, the following type information is constructed: 
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sign: 


unsigned 


significant width: 


5 


Representative width: 


16 


High-bit properties 


none 



10 



15 



20 



25 



Since ail sub-expressions in the given source expression have the same type, then the above data is con- 
structed for each sub-expression during the recursive part of this method, in the following we use the notation: 

(high - bit properties) expression 

to denote a given target expression annotated with some particular high-bit properties. 

Step 3:The target source expression is constructed recursively as follows: 

1 . From the source expression s: 

W a (U5) + b (U5)) * C (U5)) / ( d (U5) + e (U5)) 

the two operands: 

S o = W a (U5) + D (U5)) * C (U5)) 



40 



50 



{6 



(US)" 



S (U5)) 



are translated recursively into t 0 and fj as follows: 

(a) From the source expression s 0 , the two operands: 



S 00 _a (U5) + D (U5) 



S 01 ~ C (U5) 



are translated recursively into f w and t 01 as follows: 
i. Rom the source expression Sqq, the two operands: 

S 000 = a (U5) 
S 00O = b (U5) 

are translated recursively into fax) and fooo by simply constructing a unique target variable for each 
source variable and thus giving: 

s 000 = <none>a (U5) 
s ooo = < none > b (U5) 



55 



The high-bit property mapping constructed in step 2 does not assign any high-bit properties to the 
above variables, and therefore none are set. 

ii. Since the operator concerned in Sqq is +, then no value correcting operators need to be applied on 
the two operands fooo anc ' fa>i> and therefore the following target expression foo ' s constructed: 
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^00 = < none > a (U16) + b {U16) 

No high-bit properties can be guaranteed and therefore none are set. 
in. The source expression s 01 is translated recursively into f 01 by simply constructing a unique target 
variable for the source variable and thus giving: 

f 01 =<none)c (U16) 

The high-bit property mapping constructed in step 2 does not assign any high-bit properties to the 
above variable, and therefore none are set 

(b) Since the operator concerned in s 0 is *, then no value correcting operators need to be applied on the 
two operands and f 01 , and therefore the following target expression t Q is constructed: 

f 0 = <none) ((a (U16) + b^) * c (U16) ) 

No high-bit properties can be guaranteed and therefore none are set. 

(c) From the source expression s 1( the two operands: 

5 10 = d (U5) 

5 11 = e (U5) 

are translated recursively into f 10 and t n by simply constructing a unique target variable for each source 
variable and thus giving: 

f 10 = <none>d (U16) 
f in = <none>e (U16) 

The high-bit property mapping constructed in step 2 does not assign any high-bit properties to the 
above variables, and therefore none are set. 

(d) Since the operator concerned in is +, then no value correcting operators need to be applied on the 
two operands f 10 and f 11? and therefore the following target expression f 1m is constructed: 

f t =<none> (d (U5) + c m ) 

No high-bit properties can be guaranteed and therefore none are set. 

2. Since the operator concerned in s is /, then the target expressions f 0 and fj are applied to the masking value 
correcting operation giving: 

F 0 = (masked) ((a ^ U16 j + b( U16 j) * c (ui6)) * 31 (Ui6) 
= (masked) (d (U16) + e^) & 31 ^ 

and the target expression 
t = (none) 

((( a (U16) + D (U16)) * C (U16)) & 31 (U16)) / (( d (U!6) + e (U16)) & 31 (U16)) 
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is constructed Since the masked property can be guaranteed in the result of the unsigned operation, then the 
masked high-bit property of is set giving the following final internal representation of f the target expression: 

(masked) 

5 

(M a (U16) + D (U16)) * C (U16)) & 31 (U16)) ' (( d (Ul6) + e (U16)) & 31 (U16)) 

The last stage is to strip the additional type information from the above expression, thus giving: 

w 

(W a (U16) + D (U16)) * C (U16)) & 31 (U16)) ' (( d (U16) + e (U16)) & 31 (U16)) 

[0081] This technique may be used in the efficient simulation of high-level descriptions of arithmetic circuits on a 
specified target machine. Figure 7 illustrates an example of a Bach hardware design flow in which hardware is 

15 described in the Bach high-level language. The hardware description is implemented at 40 and results in Bach source 
code. A step 41 simulates the Bach hardware description and makes use of the method described hereinbefore to pro- 
vide a translation of operations or expressions which are to be implemented in the arithmetic and/or logic circuits of the 
target machine. The simulation result is checked for correctness at 42. If incorrect, a fresh implementation is performed 
in the step 40. rf the simulation result is correct, a step 43 generates a low-level hardware description which is then con- 

20 verted using known or appropriate techniques to a silicon chip 44. 

Claims 

1 . A method of translating a source operation on at least one source operand comprising a binary integer of a first bit- 
25 width to a corresponding target operation for evaluation by a processor which performs integer operations on binary 

integers of a second bit-width which is greater than the first bit-width, the method comprising: translating (2) the 
source operation to a target operation having at least one target operand; identifying (3, 5) whether the value of 
unused bits of the or each target operand affects the value of the target operation and whether the target operand 
or any of the target operands is capable of having one or more unused bits of inappropriate value; and, if so, adding 
30 (6) to the target operation a correcting operation for correcting the value of each of the one or more bits of inappro- 
priate value before performing the target operation. 

2. A method as claimed in claim 1 , characterised in that the at least one source operand comprises at least one con- 
stant 

35 

3. A method as claimed in claim 1 or 2, characterised in that the at least one source operand comprises at least one 
variable. 

4. A method as claimed in any one of the preceding claims, characterised in that the at least one source operand com- 
40 prises at least one sub-operation. 

5. A method as claimed in any one of the preceding claims, characterised in that the source operation comprises an 
arithmetic operation. 

45 6. A method as claimed in claim 5, characterised in that the identifying step comprises identifying an unsigned 
unmasked target operand and the correcting step comprises a masking step. 

7. A method as claimed in claim 6, characterised in that the first bit-width is w bits and the masking step comprises 
performing a bit-wise AND operation between the unsigned unmasked target operand and a binary representation 

so of(2 w -1). 

8. A method as claimed in arty one of claims 5 to 7, characterised in that the identifying step comprises identifying a 
signed non-sign-extended target operand and the correcting step comprises a sign-extending step. 

55 9. A method as claimed in any one of claims 1 to 4, characterised in that the source operation comprises a relational 
operation and the correcting step comprises a monotonic operation. 

10. A computer program for performing a method as claimed in any one of the preceding claims. 
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11. A storage medium containing a computer program as claimed in claim 10. 

12. A computer programmed by a program as claimed in claim 1 0. 

13. A translation produced by a method as claimed in any one of claims 1 to 9. 
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